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Abstract 

We investigate in this work the vahdity of hnear stochastic models for nonhnear dynami- 
cal systems. We exploit as our basic tool a previously proposed Rayleigh-Ritz approximation 
for the effective action of nonlinear dynamical systems started from random initial condi- 
tions. The present paper discusses only the case where the PDF-Ansatz employed in the 
variational calculation is "Markovian" , i.e. is determined completely by the present values 
of the moment-averages. In this case we show that the Rayleigh-Ritz effective action of the 
complete set of moment-functions that are employed in the closure has a quadratic part 
which is always formally an Onsager-Machlup action. Thus, subject to satisfaction of the 
requisite realizability conditions on the noise covariance, a linear Langevin model will exist 
which reproduces exactly the joint 2-time correlations of the moment-functions. We com- 
pare our method with the closely related formalism of principal oscillation patterns (POP), 
which, in the approach of C. Penland, is a method to derive such a linear Langevin model 
empirically from time-series data for the moment-functions. The predictive capability of 
the POP analysis, compared with the Rayleigh-Ritz result, is limited to the regime of small 
fluctuations around the most probable future pattern. Finally, we shall discuss a thermo- 
dynamics of statistical moments which should hold for all dynamical systems with stable 
invariant probability measures and which follows within the Rayleigh-Ritz formalism. 



1 Introduction 



We consider nonlinear dynamical systems governed by (possibly nonautonomous) differential 
equations: 

i = U(x,t). (1.1) 

We may take x = (xi, Xp)"'" to have any number of components, possibly infinitely many, 
formally including infinite-dimensional dynamical systems governed by partial differential equa- 
tions, etc. In many contexts the dynamics of a selected set of variables i/'(x) = (^i(x), ^„(x))''' 
is of interest. Of course, by the chain rule 

dt^ = (U • Vx)'0 := V. (1.2) 



When the dynamics is nonlinear, the righthand side V of equation ( |l.2D cannot generally be 
expressed in terms of the functions xjj themselves. For example, when U(x) and ■j/'(x) are 
polynomial functions of x, the righthand side consists of higher-degree polynomials. This is a 
manifestation of the closure problem of nonlinear dynamical systems. 

If one considers the initial- value problem with random initial data, a common strategem to 
obtain mean values m(t) := is to make a moment closure approximation ((U • Vx)''/')t ~ 
V(m, t) for some function V of the selected moments, so that a closed equation 

m(t)=V(m,t) (1.3) 

is obtained. Likewise, if one is interested in fluctuations of the variables V'(^)) then one can 
make an approximation that 

i{t)^\{^,t)+m (1.4) 

where q(t) is a random force of known statistics, which is supposed to represent the effects of 
neglected variables beyond the subset ■0 retained. Such a model — if it is valid — will clearly 
give important information about predictability of the variables ■j/'(t). For example, conditional 
probabilities P{tl^,t\'ipQ,to) are implied, which express precisely the limits on predicting the 



moment- variables at a future time t given their values at the present time to- One is thus 
interested to know the possibilities and limitations of such a stochastic modelization. 

In previous work [|^, |2| we have studied fluctuations of nonlinear dynamics by an action 
principle. Such an approach to fluctuation theory goes back to the work of Onsager and 
Machlup . They showed that a linear Langevin dynamics 

= A(t)^» + q(0, (1.5) 
in which q(t) is a Gaussian random force with zero mean and covariance 

{qm~'it')) = Qmt-t'), (1.6) 

can always be completely and equivalently reformulated in terms of an action functional T[ip]: 

r[^] = - / (it (^ - A^pyQ-H^l, - A^P). (1.7) 

4 J to 

The interpretation of this functional is as a "fluctuation potential" for time-histories. That is, 
the probability that a particular fluctuation value ip{t) occur for the random variable i/'(t) is 
given in terms of the Onsager-Machlup action by the exponential formula 

Prob (^>(t) ^(t) : -oo < t < +oo^ ~ e"^t^l. (1.8) 

This gives the most direct probabilistic significance of the Onsager-Machlup action. The fact 
that the action is a quadratic functional of is consistent with the fact that the solution of the 
linear Langevin equation is a normal random variable, with a Gaussian probability distribution. 

Although the Onsager-Machlup theory as originally developed was restricted to linear 
Langevin dynamics, the action method is completely general: For any statistical dynamical 
system and for any selected subset of random variables i/', an effective action T[tp] can be in- 
troduced which plays the same role as the Onsager-Machlup action does for linear Langevin 
dynamics. It has too an interpretation as a fluctuation potential Q for the empirical average 



^The additional factor of A'^ in the exponent in (1.9) is discussed more below. See Eqs. (2.44)-(2.46) 
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over N independent samples i.e. 

Prob ^ ip{t) : -oo < t < +00^ ~ exp (-7V • r[-0]) . (1.9) 

The effective action is also a generating functional for all (irreducible) multi-time correlations 
of the variables ip{t), of arbitrary order, and thus completely characterizes the distribution of 
those variables. To be precise, if the effective action is expanded into a functional power series 
in 6ip{t) := ipit) - (^(t)), as 

r*[^] = Erf / dh--- I dtkT\';l.^ih,...,tk)5^,,ih)---5ij,,itk), (1.10) 

k=2 

then the coefficients are just the irreducible multi-time correlators |^]. The correlators with 
k > 3 would all be zero for a Gaussian process. We shall not review these subjects further here, 
since they have been thoroughly discussed elsewhere [||, 

In our earlier works, we developed a Rayleigh-Ritz approximation method by which the 
effective actions r[z] of any set of random variables Z may be calculated within a PDF-based 
moment-closure scheme. In particular, an approximate effective action may be obtained for ■?/», 
the moment-variables retained in the closure. We shall show here that such a Rayleigh-Ritz 
approximate effective action r=K['0] of the moment-variables themselves is not only a formal 
generalization of the Onsager-Machlup action, but is actually much more closely related. In 
fact, we shall show that the leading quadratic term in the Taylor expansion ( 1.1C| ) of r=K['j/>] is 



always precisely of the Onsager-Machlup form, when the PDF-^nsate employed in the Rayleigh- 
Ritz calculation is "Markovian". By the latter specification we denote PDF-Ansdtze which 
are completely determined by the present values which they assign to averages of the moment 
functions. Our result means that, for such a "Markovian" PDF-Ansatz, there is always formally 



a linear Langevin dynamics such as (lj_5) which gives predictions for the 2-time correlations 
{6rp{t)6rp {t')) that are the same as those given by the Rayleigh-Ritz effective action. In general, 
however, for higher-order correlations, the linear Langevin model and the Rayleigh-Ritz effective 
action will not yield the same predictions. 



It is our purpose here to present the derivation of the Unear Langevin model via the effective 
action method and to discuss its physical interpretation and limits of applicability. The effec- 
tive action provides a framework to derive not only the linear theory but also the higher-order 
statistics (higher order in terms of the size of the fluctuations or the order of the correlator). 
It thus provides a means to assess the size of the corrections to the linear description. On 
the other hand, the linear Langevin model gives always the leading-order contribution to the 
effective action and, therefore, many of the important features of the full Rayleigh-Ritz ap- 
proximation are essentially entirely determined by the linear equation. We shall in addition 
give a more intuitive derivation of the Langevin model within moment-closure methodology, 
but not using a systematic or formal scheme. While such a derivation provides no possibility 
to assess limitations of the linear description, nevertheless it provides insight into the physical 
assumptions involved. We shall also compare our method with the principal oscillation pattern 
(POP) analysis, which is a well-known method to extract linear stochastic models empirically 
from time series data Q. Finally, we shall conclude with some general discussion on the ther- 
modynamics of moment-averages for dynamical systems with stable statistics. In particular, 
we discuss the law of entropy increase and fluctuation-dissipation relations at the linear level. 

2 Rayleigh-Ritz Effective Action of Moment Variables 

(2.1) Reprise of the Rayleigh-Ritz Method 

The Rayleigh-Ritz approximation of the effective action is based upon a variational formulation 
of the moment closure scheme. This is just a variational formulation of the method of weighted 
residuals Q to solve dtV = — Vx*(U'P) := CV, the Liouville equation for the phase-space 
distribution V. The basic ingredients of a moment closure are (i) a set of moment functions 
■^(x, t) = ('^i(x, i), ^/;„(x, i)) and (ii) a PDF Ansatz 'P{x;in,t), conveniently parametrized 
by the mean values = (i/ji), i = 1, ■■■,n which it assigns to those functions. The variable 
t is included to denote an explicit time-dependence, i.e. any time-dependence other than the 
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implicit one through parameters a(t),m{t). The a-type parameters appear in the variational 
formulation of the closure, in which one incorporates all of the moment functions into a single 
linear combination, 

n 

A{x;a,t) = Y,c^iM^,t)- (2-1) 

i=0 

Note that the constant function tpo^x, t) = 1 must be included in the sum in order to satisfy the 
final-time condition ^(oo) = 1. The PDF Ansatz V{x;ni,t) is subject to an initial-condition 
that it match the considered initial distribution Vq for the problem, P(x;m, tg) = "Po) in the 
weighted-residual sense that the averages of the n moment-functions ip{t) must match. Then, 
it is not hard to show that the moment closure equation 

m(t) = V(m(t),t) (2.2) 

where V(m, t) := {{dt + C^)tl^{t))in,t, is the result of varying the action functional 

r[A,V]= dt[{A{t),V{t))-{A{t),CVm (2.3) 

Jto 

over the above Ansdtze for A{t),V{t), with variational parameters aQ{t), a{t),in(t). The Euler- 
Lagrange equations for m{t) are just ( p.2| ) while the equations for ao{t),cx.{t) have the unique 
solution ao{t) = l,a{t) = subject to the final-conditions. 

The Rayleigh-Ritz approximation to the effective action [z] of a set of random variables 
Z is obtained, in general, as the stationary point F*[z] = st.pt._4 pF[^, varied over A{t),V{t) 
of the above forms, subject to the additional constraints of unit overlap 

{Ait),V{t)) = l (2.4) 

and fixed expectation 

{Ait)M)V{t))=zit) (2.5) 

for each given history z(t), for all times t after the initial time to- Q 

^Recall that Z(i) is an observable in "Schrodinger picture" and that the only time-dependence is explicit. 
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We show here that the Rayleigh-Ritz approximation T^..[ip] to the effective action of the 
moment- variables themselves has, in general, a quadratic part which is just an Onsager-Machlup 
action, when closure is achieved within the framework outlined above. Other closure schemes 
are conceivable within the Rayleigh-Ritz formalism and may even better represent the physics 
in certain situations. A "Markovian" approximation has been made above, in assuming that 
the PDF Ansatz V{-;in,t) is parameterized by only the present value m(t) of the n moment- 
averages. This is by no means necessary. More generally, one may assume that 'P[-;m, t] is a 
functional of the entire past mean-history {m(s) : s < t} of the n moment-functions. In that 
case, the closure equation becomes 

m(t)=V[t;m], (2.6) 

in which V[t; m] is now also a functional over the past mean-history. It is not hard to show that 
an exact equation always exists of the form (|2.6| ) for a suitable choice of the functional V[t; m]. 
(E.g. see Q, Appendix Al.) Thus, a closure incorporating such history effects is likely to be 
more faithful to the physics, in general. Our work here does not discuss this more general case, 
but confines itself to the "Markovian" Ansatz. Although this is restrictive, it is nevertheless the 
case that most practical closures considered in the literature are of this type. It is also possible, 
even within this more restrictive "Markovian" framework, to include some history effects. This 
may be done, for example, by constructing a closure using not only the n moment-functions 
'0(t), but also the corresponding n velocity-functions^ 

Vit):=idt+C^Mt). (2.7) 

In this case, the closure equations become, instead of (|2.2| ), 

m(t) = V{t) 

V(t) = G(m(t),V(t),t), (2.8) 

^This definition generalizes that in Eq.(^!^) to the case with explicit time-dependence. 
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where G(m, V,t) := {{dt + C^)y{t)) ni,v,t is the average acceleration of the moment-functions 
within a P'DF-Ansatz depending jointly upon m, V. Such schemes may be continued indefinitely 
to higher-orders, e.g. the next stage would be to include a dependence jointly upon m, V, G 
in the FDF-Ansatz. All of the results of this work carry over to such closures in terms of 
higher-order time-derivatives, if one simply considers the enlarged set of moment-functions 
^ = V), (i/j, V, G), etc. A linear Langevin model will always formally exist within such 
closure schemes which will exactly reproduce the predictions of the Rayleigh-Ritz effective 
action for the 2-time statistics of the moment- variables ^ considered. 

(2.2) Rayleigh-Ritz Effective Action: Exact Expressions 

It is the main purpose of this paper to demonstrate the latter essential fact. We will begin by 
developing some exact expressions for r*['j/']. Substituting the given forms of A{t),V{t) into 
the action (|2.3|), one obtains 

r = J2 dtai{t)[m{t)-V^{m{t),t)]. (2.9) 
i=l •'^0 

The overlap constraint ( |2.4p may be incorporated by eliminating the coefficient ao(t), giving 

n 

Ait) = 1 + ^a,(t)(V^,(t) - m,(t)) (2.10) 
i=i 

With that choice, the fixed expectation constraint ( p.5[ ) becomes 

n 

V',(t) = rrnit) + J2 o^j{t)a,{t) (2.11) 
i=i 

where C{t) := {il^{t)il^ (t))t— m(t)m^(t) defines the covariance matrix of the moment-functions. 
Equation ( |2.11| ) is easy to invert, with the result that cx{t) = C^'^{t)['ip{t) — m(t)]. It is 
convenient to denote the inverse of the covariance matrix by T{t) := C^^(t). Substituting the 
above results for Q(t), the action becomes 

roo 

r,[i/>;m] = / dt [m(t) - V(m(t),t)]^r(t)[^(i) -m(t)]. (2.12) 

Jto 
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In this expression, all of the constraints have been properly incorporated, and the only remaining 
variational parameters are the m(t)-variables. A set of variational equations must be developed 
to determine these, derived from the stationarity condition of the action. 

Before carrying out this variation, however, it is useful to introduce some auxilliary quan- 
tities. Define, within the PDF Ansatz employed, a single-time cumulant-generating function 

F{h\m{t), t) := log(exp(h.^(t)))t (2.13) 

for the moment-functions ip{t), where ni(t) is the mean of the moment-function within the 
PDF Ansatz. That is, the partial derivatives 



(2.14) 

h=0 



are just the pth-order cumulants of the ip{t) within the Ansatz. The Legendre transform 

H{iJ,\m{t), t) := sup[/i-h - F(h|m(i), t)], (2.15) 

h 

a generalized entropy, is the generating function of irreducible correlation functions. That is, 

(2.16) 

u^n---u^i^ ^^^^^^ 
In particular, the relations hold that 



r<"(f) = A,(m(t),t), r(|)(t) = C5'(t) = |^(m(t),t). (2.17) 

The latter relation will prove crucial in what follows. 

It may be worthwhile to explain the intuitive significance of these single-time quantities 
before continuing with the development of the formulae for the action. They are all part 
of a general thermodynamics of moments. Thus, the entropy H is a form of Boltzmann's 
entropy, with his original sign convention, i.e. positive and convex. It is related to fluctuation 
probabilities of the empirical ensemble averages Va^IO •= J2n=i '^^ time t by 

V (^^(t) « /x| m(t), t) ~ e-N-H{n\m{t),t) ^ (2.18) 



(n) 

where the samples xj) (t) are all independently chosen from the ensemble V{-\m{t),t). In 
other words, H oc — logProb, which is Boltzmann's famous relation. Because we have defined 
probabilities with respect to the measure V{-\m{t),t), this quantity corresponds to what is 
in mathematics called the relative entropy. The latter is an entropy of probability measures 
analogous to Gibbs', but with respect to an arbitrary a priori measure V. Thus, 

HiQ\V):= J Qlog(l). (2.19) 

It is not hard to show that 

H{fi\ui{t),t) = min n{Q\V{-\-m{t),t)) . (2.20) 

S:('0(t)>S=/X 

This is a basic relation between the "thermodynamic" entropy H{-\in{t),t) and the "statistical 
mechanics" measure V{-\m(t),t). It is a form of the maximum entropy principle ^ A good 
mathematical reference is Q. As we shall see later, H^{t) := //(m(t)|m* {t),t) should satisfy the 
second law of thermodynamics, dH^{t)/dt < 0, when ni^:{t) is the predicted mean history of the 
moment functions and m(t) is any other solution of the closure equations sufficiently near m*(t). 
The derivatives of H also have thermodynamic significance. For example, the first derivatives 
h(/x|m*(t), i) := ^(/2|m*(t), t) are the thermodynamic forces which give the departure of the 
moments from the predicted means in^:{t). Note, therefore, that h{fj,\m^:{t),t) = if and 
only if /X = m^(t). On the other hand, the Legendre transform 

F{h\m{t),t) := sup[//-h - H{n\m{t),t)], (2.21) 

is a generalized free energy. It was defined already in ( |2.13| ) above via the logarithm of the 
"partition function" Z{h\in{t),t) := (exp(h-'0(t)))n^(j) j. 

With this background, let us return to our analysis of the Rayleigh-Ritz effective action. 



Equation ( 2.17 ) yields immediately a useful expression, complementary to ( 2.12 ): 



r*[V';m] = / dt 

Jto 



d dh 

-h(m(t),t) - — (m(t),t) - W(m(t),t) 



T 



m)-in{t)], (2.22) 



^This a indeed a maximum principle in terms of the usual entropies 5'(/i|m(t), t) — —H(ii\m(t),t), and 
5(Q|P) = -H{Q\V). 
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where we have defined the new vector by matrix multiplication: 



W(m(t),i) := r(m(t),i)V(m(t),t). 



(2.23) 



Indeed, it follows from the chain rule and ( 2.17| ) that 



|h«) = r(t)m«) + fw. 



(2.24) 



Because of symmetry of T, it follows that (r(i)m(t))^ = TnoJ{t)T{t). Thus, we may use the 
previous relation to write 



|m(() - V(()]Tr(() 



^h(,)-f«)-wd" 



(2.25) 



When substituted into (|2l^ ), the result is (|232|) . This is a more convenient form for variation. 



Indeed, setting = gives 



\ dt drrik 



(2.26) 



3 J ~ J 

where the relations ( p.l7| ),( |2.25 ) have again been employed. Simplifying, we obtain finally 



„_vrr.„-.r(f.|^).o. 



(2.27) 



This is the variational equation to determine m{t) for a given i/'(t). When it is employed to 
eliminate m(t) in ( |2.12| ) or (2.22), the result is r*['j/>], the final Rayleigh-Ritz approximation to 
the effective action of the moment- variables ■j/'(t) in the closure. 

One more transformation of the action is useful. We may write ( p.l2| ) as the sum of two 
terms: 

Y^[ijj] = - [-0 - m]^r['0 - m] + / dt V]^r[^-m]. (2.28) 

In the first term we integrate once by parts, while in the second we use (|2^ ). This yields: 



1 



m 



^dT aw /dW\ 
dt dt dm \ dm J 



(1/7 — m) 



(2.29) 



Up until this point, no approximation has been made except Rayleigh-Ritz. Equation ( p. 29)) is 
the most convenient form to calculate the quadratic part of the Rayleigh-Ritz action. 



11 



(2.3) Quadratic- Order Action and Linear Langevin Model 

We shall now calculate the quadratic part of the full Rayleigh-Ritz effective action [■?/>]. An 
important quantity which appears is the linear stability operator about a solution m(t) of the 
moment-closure equations rii = V(m, t), that is, 



dV 

A{t) := _(m(t),0. 
am 



(2.30) 



The subscript * shall be used hereafter to indicate that the substitution of the particular solution 
m*(t) for given initial data mo* has been made: thus, A*(t) := |^(m*(t), f). It is easy to relate 



to the linear stability operator. In fact from the definition of W in ( ^.23| ) it follows that 



dWi 
drrij 



(2.31) 



where T\.l{t) = g^(m(t),t) denotes the single-time 3rd-order irreducible correlation function 



within the PDF Ansatz. Using this function again. 



If. 



dV 



df' dt 

Then, by means of ( |2.31| ),( p.32 ), we can see that 

dT 



d^ ^dT dW /dW'^ 
dt dt dm V dm 



dt 



+ {m- 2V)-Vmr - FA - (FA) 



Furthermore, using rii,,, = V{m^:,t), it follows that 



dT, 



+ (m=, - 2V*)-VmF* 



-— -(v..v„,)r, = -|r. 



(2.32) 



(2.33) 



(2.34) 



dt ' ' dt 

The effective action to quadratic order in deviations 6xl){t) := tp{t) — m^(t) from the solution 
m,,(t) of the moment-equation is then found to be 



Ff [5^] 



1 



to 



dt {5il) - 5m) 



dt 



Fj; A.* 



(F* A:^) 



((5-0 — 6m). 



(2.35) 



In this expression, the quantity 6m{t) := m(t) — m*(t) is to be determined in terms of 



Sip{t) from the variational equation ( 2.27 ) linearized about the solution ip,{t) = m*(t). It is 



convenient to rewrite ( 2.27 ) as 

- V(^, t) + [V(0, t) - V(m, t)] + C 



dT /■dW\ 
dt V 9m J 



{tp — m) = 0. 



(2.36) 
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Using again (2.31), (2.32), it is then straightforward to linearize this equation, yielding the 



variational equation for 5m{t): 

(5^-A*(5-0)-2Q*r*((5^/)-(5m) = O, (2.37) 
where the definition has been introduced 

= — A=kC^ — CkAJ. (2.38) 

Note that := — A*C* is the Onsager matrix, in terms of which the hnearized closure equation 
may be written in force-flux form: Ji/j = — L^,5h. Then (|2.38| ) may be restated as 

Q, = ic,+L:, (2.39) 

with := * 2 * denoting the symmetric part of the matrix L*. 

One may now obtain a final form for ri^^[5i/j] by eliminating 6m{t) from ( |2.35| ) by means 
of (im) and by using the definition (|2.38D of Q=k. One obtains 



to\ 1 

rr^[<5^] = / dt{6ip- AJ-4,yQ-\5iP - AJtP). (2.40) 

4 J to 

This is the final result. One observes that it has the form of an Onsager-Machlup action. That 
is, the Rayleigh-Ritz result for Tl [6ip] is formally equivalent to the effective action that would 
be obtained for the solution of a linear Langevin model Sip_^{t) of the fluctuation variable 5'ip{t). 
To be precise, the model stochastic equation is 

5x1^+ = A,{t)6iP^ + qit) (2.41) 

where q(t) is a random force, white-noise in time, with zero mean and covariance 

(q(t)qT(t')) = 2Q.(t)5(t-t')- (2.42) 

Note, in this context, that equation ( p.39| ) is the time-dependent generalization of the fluctuation- 
dissipation relation (of the 1st type), connecting the noise covariance matrix Q^, and the sym- 
metric (dissipative) part of the Onsager matrix. For details, see Section 6. 
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To understand the significance of the linear Langevin model, we must recall some basic facts 
about the effective action itself. As noted earlier, the effective action is a generating functional 
for irreducible multi-time correlation functions. That is, the /cth-order irreducible correlator is 
given by 

rj).,^(ti,...,t,) = K]. (2.43) 



In particular, these coincide with functional Taylor coefficients in the series expansion ( 1.10| ). 



Furthermore, the irreducible correlators of order up to k determine all of the cumulants — or 
connected correlators — C^i^\.-^{ti, up to the same order k. ^ From these two facts we see 

that knowledge of the Taylor series of [ip] up to terms of degree k is equivalent to knowledge 
of the Rayleigh-Ritz predictions for all multi-time correlators up to order k. In particular, 
knowledge of the quadratic term in the effective action, Fl; [dijj], is equivalent to knowledge of 
all two-time correlators as predicted by Rayleigh-Ritz. Because knowledge of the linear model 
is equivalent to knowledge of that quadratic "Onsager-Machlup" part, the key conclusion that 
we draw is that the linear Langevin model is the unique such model to reproduce exactly all the 
two-time correlators predicted by Rayleigh-Ritz. However, for correlators of higher than second 
order, the two will in general disagree. 

A simple observation which underlines this last point is the following: the solution 5ip_^_{t) 
of the linear Langevin model is always a Gaussian random function, while the true fluctuation 
variable 5'ip{t) is in general non-Gaussian. Thus, although higher-order cumulants than second 
are zero for the Langevin solution S'ip_^_{t), they are generally non-zero for the true fluctuation 
variable 6ip{t). Of course, it is clear that for any random process dil){t) there is a Gaussian 
random process 6tp^{t) which has the same mean and variance (when those exist). In fact, 
there is only one such Gaussian process, in the sense that its distribution on the path-space of 
histories is uniquely determined. This result is sometimes called the Khinchin-Cramer theorem. 
One way to construct such a Gaussian process is via the Central Limit Theorem. 



"'For example, for fc = 2, C,-^i^{ti,t2) = (r ^)iii2 (ti, t2); for fc = 3, C^jj^ig (fi, t2, ^s) = 
Y^j^j^isI '^^^ I '^^^ I '^•'3 C'nji(ii>si)Ci2j2(t2,S2)Ci3^3(t3,S3)rjii2j3(si,S2,S3); etc. See [|]. 
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The connection of the hnear Langevin model to the Central Limit Theorem is quite deep. 
In fact, the precise empirical significance of the linear Langevin model is that its predictions 
should be valid for the normalized sum variable: 

1 ^ , ^ 

6i,^{t):=^Y.^^ (2-44) 

V „=i 

in the limit N ^ oo, where the sum is over N independent, identically distributed samples. To 
prove this fact, recall that the effective action is a fluctuation potential for the empirical average 
over N independent samples, in the sense of equation ( |l.9D . Now we consider the probability 
of a small fluctuation value differing from the ensemble- mean by terms of order O • That 

is, we consider fluctuations 

^(t) = m,(t) + ^, (2.45) 
V -/V 



for 6tp{t) = 0(1). Substituting ( p. 45 ) into ( |1.9| ) and employing the functional Taylor expansion 



( |1.10| ) of r [ip] , it is then straightforward to show that 

Prob ((5V'jv(i) ~ '^V'(i)) ~ exp (^-T^^^Stp] + O (^)) • (2-46) 

In the limit as ^ cxd we arrive at the stated result. It is clear that the distribution of (5'j/>^(t) 
is Gaussian in the limit. ^ Furthermore, r(^)[?/'] acts as the Onsager-Machlup action of the 
limiting Gaussian variable, hence described also by the equivalent linear Langevin model. 

It is extremely important to emphasize that the existence of such a linear Langevin model has 
only been formally established, and only for the Rayleigh-Ritz approximation r*['0]. In general, 
the Central Limit Theorem only guarantees that a Gaussian process should exist with the 
same mean and covariance and not necessarily a process obtained from a stochastic differential 
equation or, for that matter, even a Markov process. It is thus a very striking prediction of 
the Rayleigh-Ritz method with a "Markovian" Ansatz — and far from obviously true — that 
the 2-time correlations should be reproducible by such a linear Langevin model. Indeed, this 



prediction can fail in a very striking way: the noise covariance Q=i,(t) given formally by ( 2.38 ) 



^In fact, we have just repeated above one of the standard proofs in the hterature of the Central Limit Theorem! 
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may turn not to be nonnegative! Of course, nonnegativity is a fundamental requirement for any 
true covariance function. If it fails, then the "linear Langevin model" exists only in some formal 
sense and there is no actual stochastic process which realizes the model. Put another way, the 
Rayleigh-Ritz approximation might fail to satisfy the realizability properties requisite for 

any true effective action. The relevant realizability properties (positivity, unicity of minimizer, 
convexity) have been discussed at length elsewhere [||, |l^ . It is easy to see that these properties 
of T^lip] will hold, at least for ip{t) close to the mean history m*(t), if and only if r* [6tp] > 0, 
with strict inequality for all 6ilj{t) ^ 0. Furthermore, examination of the Onsager-Machlup 
action ( |2.40 ) shows that realizability of [(5-0] holds if and only if the formal noise covariance 



Q*(t) appearing in the Langevin model is positive. 

3 A Simple Example 

It is interesting to compare the linear Langevin model with the full nonlinear Rayleigh-Ritz. 
In general, this should allow one to assess the limitations of the Langevin model for any given 
problem, in particular, to assess quantitatively how large are the corrections to its predicted 
Gaussian statistics. To illustrate the comparison of the Langevin dynamics and full Rayleigh- 
Ritz, we will discuss here very briefly a 3-mode model already considered in [p!o| , pT| . This is 
a simple "one-step cascade" model of dissipative turbulent dynamics, originally introduced by 
Lorenz in 1960 []l2|| . The dynamics are just the Euler equations of a top, but stochastically 
driven and linearly damped. The three modes are x = {xi,X2,X3), of which the first is the 
driven, unstable mode, and the second two are stable, damped modes. More specifically, the 
equations of motion are given by 

with i,j,k a cyclic permutation of 1, 2, 3. Note that A1+A2 + A3 = for conservation of energy 
by the nonlinear terms, the damping constants are > 0, i = 1,2,3 and the random driving 
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forces are zero mean with covariance 



{Mt)f,{t')) = 2K,6{t-t'), 



(3.2) 



all Kj > 0, i = 1, 2, 3. A "chi-square" PDF Ansatzwas proposed for this system by Bayly, which 
leads to the quasinormal closure equations. For full details of the model and closure, we refer 
to |ll[|. Here we will simply remind the reader that the basic moments in the quasinormal 
closure are the three modal energies Ei = ^xf, i = 1, 2, 3 and the triple moment T = X1X2X3, 
which gives the energy transfer out of the unstable driven mode and into the stable, damped 
modes. In the notations of this paper, the closure dynamics is given by m = V(m) with 



m 



E2 

V ^ / 



& V(m) 



. (3.3) 



AiT - 2viEi + Ki 

A2T - 2U2E2 + K2 
A3T - 2U3E3 + K3 
\ 4(^1^2 £^3 + ^2^1 ^3 + ^3^1 ^2) - {lyi + 1^2 + J^3)T J 

Although very simple, this model and closure will illustrate several key features of our method. 
In addition to the key comparison of full Rayleigh-Ritz and linear Langevin model, it will allow 
us to discuss some important issues concerning realizability. 

The parameters of the linear Langevin model arising from the chi-square closure for the 3- 
mode system are easy to determine. The general form of the model is given in Eqs.( 2.41 ) (|2.42| ). 
The dynamical matrix A (here time-independent, since we consider only the statistical steady- 



state) is given by the linearization of the closure equation, A = or. 



-2ui 







-2l/2 








-2U3 



\ 



(3.4) 



Ai 
A2 
As 

\ A{A2E3 + A3E2) 4{AiE3 + AsEi) 4(Ai^2 + ^2^i) ~ {j^i + 1^2 + J^s) ) 
To obtain the matrix A,,, appearing in the Langevin model for a specific situation, the corre- 
sponding moments m^, satisfying the fixed-point condition V(m^,) = must be substituted. 
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The noise covariance Q can be calculated as the symmetric part of the Onsager matrix L, and 
the latter is derivable from L = —AC, once the matrix covariance C of the moment-functions 
■0 is known. The latter is provided by the PDF Ansatz, in this case Bayly's chi-square Ansatz. 
A simple calculation in that case gives 



/ 



2Ef + |r4/3 



1 7^4/3 



17^4/3 
17^4/3 



2^2 + 3^4/3 



1 7^4/3 



lrpA/3 

1 2^4/3 
2^2 ^ 3^4/3 



2EiT + ST^/^ 2E2T + 3r^/3 2E3T + 3r^/3 



V 



2EiT + ST^/^ 
2E2T + 3r5/3 
2E3T + ST^/^ 
8E1E2E3+ 

4{Ei +E2 + E3)T^I^ + 19r2 



(3.5) 



Again, the matrices d, and, thence, Q=f are obtained by substituting the fixed-point moment 
values m^< . It is worth emphasizing that V and C are the only statistical inputs required from 
the PDF Ansatz at the level of the linear Langevin model. If one is not interested to carry 
out a full nonlinear Rayleigh-Ritz calculation, then these are the only quantities that need be 
provided a priori to construct the linear model. 

For the steady-state dissipative cascade of the 3-mode dynamics it is quite easy to calculate 
both A* and Q*. We have done so numerically with the same choice of parameter values of 



the 3-mode model as in our earlier work |1C, 11 1. The results given to four decimal places are 

/ .... .... \ 



(3.6) 



-0.002 


0.000 


0.000 


2.000 


0.000 


-2.000 


0.000 


-1.000 


0.000 


0.000 


-2.000 


-1.000 


-2.001 


-0.996 


-0.996 


-2.001 



and 



3.385 


0.545 


0.545 


-6.86^ 


0.545 


0.246 


-0.796 


1.815 


0.545 


-0.796 


0.246 


1.815 


-6.868 


1.815 


1.815 


8.428 
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(3.7) 



The matrix A* can be easily checked to have all eigenvalues with negative real parts. This 
indicates that the closure fixed point m* is linearly stable. However, it turns out that the 
putative noise covariance Q* has eigenvalue spectrum 13.426,1.042,1.031 and —3.194. One of 
the eigenvalues is negative! Thus, there is a breakdown of realizability in the Langevin model 
for this dissipative cascade state. 

Such a breakdown is also known to occur frequently in applications of the POP method, 
which we shall discuss at some length in Section 5. For example, Penland in her fundamental 
work Q obtained negative eigenvalues for Q^, in a POP analysis of a different quadratically- 
nonlinear 3-mode system, the chaotic Lorenz model. Her interpretation of this realizability 
breakdown is that it was due to nonlinearities of the Lorenz model that could not be modelled 
as white-noise random forces. This may be true, but it is not necessarily an indication that the 



linear Langevin model fails completely, for all statistics of the system. In our earlier work [|^, 11 1 
we have pointed out that the chi-square Ansatz for our 3-mode system — despite its leading to a 
nonrealizable linear Langevin model — nevertheless produces very good quantitative predictions 
for several statistics. For example, E2,E3 and T are all predicted to within about 0.3% and 
only the value of Ei is badly underpredicted (by a factor of 3). Thus, simply labeling the model 
as "bad" because it leads to a nonpositive noise covariance Q* would be counterproductive, for 
good predictions would then be thrown out with the bad ones. What is needed are realizability 
diagnostics which are more focused and selective, that can help to pinpoint precisely which 
predictions are good and which bad. 



In [^, 11] we have proposed that such diagnostics in the statistical steady-state are provided 
by the effective potentials. For any dynamical variable Z(t) of the system, the effective potential 
V{z) is a fluctuation potential for the empirical time-average Zt := ^ Jq dtZ{t). That is, 



Prob (Zt « z) ~ exp(-T • V{z)), (3.. 



in the limit as T ^ cxd. Because the effective potential is a measure of likelihood of fluctuations 
in the very time-average used empirically to define the mean statistics, it is plausible that it 
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should be quite sensitive to the failure of the closure for individual variables. The effective 
potential can be obtained analytically via the time-extensive limit of the effective action 

l^(z):= hm ir[zT], (3.9) 

i — ^oo 1 

in which 

{z for < t < T 
(3.10) 
otherwise 

Thus, it is easy to adapt the Rayleigh-Ritz algorithm to calculate the effective potentials. In 



1 10, 11] we have applied the full nonlinear Rayleigh-Ritz algorithm in the 3- mode system using 
the chi-square Ansatz to calculate the effective potentials of modal energies Ei and E2 and of 
the triple moment T. It was found there that the potentials and Vr are positive and convex, 
satisfying realizability, whereas the potential Ve^ was negative and convex, i.e. realizability- 
violating. In this case, therefore, the effective potentials were — as conjectured — successful in 
discriminating the good predictions from the bad. 

Here we wish to calculate these same effective potentials, but using just the linear model 
rather than full Rayleigh-Ritz. In general, a linear Langevin dynamics such as (^]^) gives 
easily the joint effective potential of all variables ■?/> therein, via the time-extensive limit of the 



Onsager-Machlup action (L7). Thus, from ( 2.40| ) one obtains directly the quadratic term 



yi')(^) = ^K, : 5^5^, (3.11) 

with 6ip = tp — m* and K* := AJQ^^A,,,. Here the dynamical matrix A* and noise covariance 
Q* in the Langevin model are evaluated at the steady-state values m* of the moment- averages. 
The effective potential of any single one of the moment variables can then be obtained by 
minimizing over the others: 

Vj^\i;i)= mm Vj'^\'il^). (3.12) 



Since the joint effective potential ( p. 11 ) is a simple quadratic form, this minimization is easy 



to carry out. In fact, if K** is the minor matrix obtained from K^, by deleting the ith. row and 
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column and k* is the vector obtained by deleting the element fcf from the ith column, then 



with At** := A:** — (K")~^ : k*k* . These last formulae allow a direct computation of the effective 
potentials of moment-variables from the parameters appearing in the Langevin model. 

In Figs. 1-3 we have plotted the parabolic effective potentials Vei,Ve2 and Vp obtained in 
this manner from the linear Langevin model corresponding to the chi-square Ansatz. The plots 



nonlinear Rayleigh-Ritz algorithm. For comparison, we have plotted both pairs of potentials 
together in Figs. 1-3, the new ones using the linear Langevin model and and the earlier ones from 
full Rayleigh-Ritz. Two points deserve to be emphasized. First, the computational expense of 
the Langevin model calculation is considerably lower than the full Rayleigh-Ritz calculation. 
Each of the separate symbols on the Rayleigh-Ritz effective potential curves was obtained by 
solving numerically a fixed point problem, coming from a perturbed closure equation. On 
the other hand, the Langevin model calculation required the solution of just one fixed point 
problem, to determine the mean moment values at the bottom of the potentials. Those are all 
that are needed to calculate the curvatures and hence the quadratic potential curves via 
( |3.13| ). Thus, the number of fixed point symbols appearing in each of the Rayleigh-Ritz curves is 
a quantitative measure of the numerical superiority of the Langevin model calculation. Second, 
we see that the two calculation schemes lead to essentially equivalent results in this example, at 
least for fiuctuations up to 20% of the mean value. At least in this range, essentially the same 
predictions for fiuctuations are obtained for the linear model as for full Rayleigh-Ritz, and at 
greatly reduced expense. 

Of course, over a wider range of fiuctuations one should no longer expect that the two 
calculation schemes will agree. In general, the full Rayleigh-Ritz calculation should capture 
important non-Gaussian fiuctuation effects that are missed by the simpler Langevin model. 
In Figs. 4-5 are plotted over wider ranges the two realizable potentials in the 3-mode exam- 




(3.13) 



cover exactly the same range as those in [IC, 11], where the potentials were calculated by the full 
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pie, and Vr, calculated again both by the full Rayleigh-Ritz method and by the linear 
Langevin model. Clearly, for fluctuations 1-2 times the means, the full Rayleigh-Ritz calcu- 
lation yields non-parabolic potentials associated to non-Gaussian statistics. The range where 
the two calculations agree gives an a priori indication of the size of fluctuations for which the 
linear model may be trusted. In the case of we see that fluctuations ~ 40% of the mean are 
well-described by the linear model, while for Vr the percentage is ~ 60%. Of course, it is an 
important question, not just whether the Rayleigh-Ritz calculation gives different results, but 
whether it improves upon the predictions of the linear model. In |1C, ^] it was already shown 



that the Rayleigh-Ritz effective potentials Ve^ and Vr give quite good quantitative results for 
fluctuations over the smaller ranges plotted in Figs. 2-3. However, there the full Rayleigh-Ritz 
and linear Langevin models substantially agree. It is difficult to get accurate results for effec- 
tive potentials in the wider ranges directly from numerical simulation of the 3-mode dynamics, 
because of the increasing rarity of those large fluctuation events. Thus, we will not show di- 
rectly an improved agreement of the full Rayleigh-Ritz calculation for the effective potentials 
in the wider ranges. Nevertheless, one important observation can be made. Because > 
in every realization, there must be zero probability for events with E2 < 0. For this reason, 
the true effective potential Ve2 must blow up, i.e. diverge to positive infinity, as the negative 
values of E2 are approached. However, because the effective potential predicted by the linear 
Langevin model is a simple parabola, it will intersect the ordinate axis £'2 = at some finite 
value of V. It will thus predict some positive probability of seeing negative values £^2 < (as 
would be true, if the fluctuations 6E2 were indeed Gaussian random variables.) However, as 
can be seen from Fig. 4 the full Rayleigh-Ritz result for Ve2 is rising faster than the parabolic 
potential from the linear model as E2 I 0. This is the correct tendency, as indicated above, and 
represents a qualitative improvement of the full Rayleigh-Ritz calculation. In general, one may 
expect that the full Rayleigh-Ritz calculation will give a more refined result, because it uses 
more information both from the dynamics and from the PDF Ansatz than the Langevin model. 
Nevertheless, it is plausible to believe — again quite in general — that the quadratic part will 
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be the term which dominates the effective potential sufficiently close to the minimum. The 
only way to violate this expectation is to have k** = and F(V'i) = O [Stpf)- Barring such 

(2) 

cases of accidental degeneracy, one can see that the quadratic term will well approximate 
the full 14 sufficiently near to the minimum. In that regime, the linear Langevin model shall 
account for the main tendencies of the full theory. In particular, the Rayleigh-Ritz effective 
potential will satisfy necessary realizability conditions in the vicinity of the mean, when the 
Langevin model itself is realizability. Thus, realizability of the Rayleigh-Ritz effective potential 
is, close to the mean history, equivalent to the realizability for a linear Langevin model. It is 
thus particularly important to understand the physical hypotheses underlying the validity of 
such a model. 



4 A Physical Derivation of the Linear Langevin Equation 

We shall now explain how exactly the same linear Langevin model can be obtained from a 
more physically transparent argument. Indeed, we shall show that the previous result for the 
quadratic order action can recovered from a single physical hypothesis: It is a basic assumption 
of the PDF-based moment closure methodology that, to characterize a probability distribution 
in phase-space, it is enough to know the mean values it assigns to the moment functions. In that 
case, the distribution is assumed to be described with sufficient accuracy by the PDF Ansatz 
which yields the same mean values for those moment functions. Let {■\tp{s),s < t) denote the 
expectation over the conditioned ensemble given the past history {■0(s),s < t} of the moment 
variables before time t. Our basic assumption is that the PDF Ansatz can be employed as well 
as an approximation for such an ensemble conditioned on the past values. More specifically, we 
shall assume that the approximation is valid that 

{Yit)ms), s<t)^ -■= (4.1) 

This is just a mathematical restatement of the hypothesis. Indeed, the conditioned ensemble 
yields the expected values {il>(t)\tp{s), s < t) = tp(t) and P(-;ip{t),t) is thus the choice of the 
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PDF Ansatz which matches those expected values, We shall employ this hypothesis mainly 
for the regime of small fluctuations, where 

V{^(t),t) = V(m,(t), t) + A,{t)S^{t) + O ((^-0^) (4.2) 

We have set Sip{t) := ip{t) — m*(t), the fluctuation variable. 

We now consider the consequences of our hypothesis for the dynamics of the fluctuations. 
We may write, without any approximation, 

dMt) = V{t) = {Y{t)\ik{s), s<t)+ q(t), (4.3) 

where the above equation is simply an implicit definition of the quantity q(i): 

q{t) :=V(t)-(V(t)|-0(s),s<t). (4.4) 

It follows directly from this definition that (q(t)) = and that 

(^.(s)qT(t)) =0 (4.5) 

for all s < t. If we now invoke our hypothesis in ( [4.3[ ), then we see that 

dMt)^V{^{t),t)+m- (4.6) 

In other words, within the approximation considered, the random moment-functions ip{t) 

satisfy the same closure equations as the mean values m(t) = {'ip{t))t, but with an additional 

stochastic noise q(t) which is decorrelated from earlier values of the moment-functions. This 

is very similiar to the regression hypothesis made by Onsager, according to which fluctuations 

should decay on average according the same macroscopic equation obeyed by the means. It is 

clear that it is exactly at this point in the heuristic derivation that a "Markovian" approximation 

has been made. It was emphasized by Onsager and Machlup (|^], p. 1509) that the regression 

hypothesis is, for a Gaussian random process, actually equivalent to the Markov property. 

^Note that P{--if}(t),t) really means P(-;m,t)\ , i.e. the average over phase-space with respect to 

m=l/'{t) 

P{-\ra,t) is always taken first, and then, subsequently, the random variable tpit) is substituted for m. 
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In the regime of small fluctuations 6il}{t), we may derive a more specific formulation. There, 
to linear order accuracy, the equation following from the hypothesis is 

« A,(i)<5-0(t) + q(t). (4.7) 

Because of the linear relation, it is clear that consistency requires the force q(t) to be white-noise 
in time. Indeed, the equation (4^) can be solved explicitly, as 



J to 

where we have introduced the (retarded) matrix Greens function 



G*(t, to) := Texp 



AJr) dr 



to 



e{t-to). (4.C 



It then follows by substituting (5'j/'(s) from ( ^ ) into ( [4. 51 ) that 

rG,(s,r)(q(r)q"^(t))dr = (4.10) 
Jto 

for all s less than t. Differentiating with respect to s then gives 

(q(.)qT(t))=0 (4.11) 

for all s < t. Thus, we see that the force must be delta-correlated in time 0: 

{q{s)ifit)) = 2Q,it)6is-t). (4.12) 

The noise covariance function Q=i,(t) is uniquely determined if we assume, consistent with our 
hypothesis, that the fluctuation covariance C(t) := {6'tp{t)6tjj {t))t is the same as C*(t) given 
by the PDF Ansatz. In that case, the noise covariance is uniquely obtained from the relation 

2Q, = C, -A*C*-C*AJ. (4.13) 



In principle the matrix Q,(t) in (4.12) could be a differential operator with finite-degree polynomial depen- 



dence on dt- We make the simplest assumption, that Q,(t) is an ordinary matrix function 
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Needless to say, we have now arrived at exactly the same linear Langevin model that we 
obtained before from the Rayleigh-Ritz approximation with a "Markovian" Ansatz. The present 
derivation should make clearer the physical assumptions involved in that more formal derivation. 

The quadratic Onsager-Machlup term in the Rayleigh-Ritz effective action will dominate in 
the vicinity of the mean history, barring degenerate cases where the quadratic term vanishes. As 
seen earlier the realizability of the effective action in that region will be essentially equivalent to 
the realizability of the linear Langevin model ( p. 41 ). The latter property is really a consistency 



check on the validity of the physical hypotheses underlying the Langevin model, in particular 
the consistency of employing the PDF Ansatz for an ensemble conditioned on the past history. 
In general, this depends upon the particular situation considered. In particular, enough vari- 
ables must be included in the moment-closure that the Markovian assumption inherent in the 
approximation is justifiable. 

This is perhaps the proper place to remind the reader that if a vector Markov process 
jp{t) is divided into two subsets then, in general, the separate subprocesses 

and tp'{t) will not be Markov. Thus, if the Rayleigh-Ritz effective action is determined not 
for the complete set of moment-variables ip{t) but instead only for a subset ip{t), then it will 
not ordinarily have the Onsager-Machlup form. However, there are special cases in which this 
is true. For example, suppose that the PDF Ansatz is such that the two variable sets are 
uncor related at equal times: 

(^'(i)^^(t)) - = O. (4.14) 

Suppose also that the closure equation of the ignored set of moments m'(t) is independent of 
the retained set m(t), that is, 

m(t) = V'(m'(t),t), (4.15) 

where V is a function of m' alone. This would be realistic for cases like fluid turbulence with 
a passive scalar contaminant. In that case, the exact velocity dynamics is independent of the 
passive scalar. In cases where these two conditions hold, the Rayleigh-Ritz action r=K[i/'] would 
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still have a quadratic part of the Onsager-Machlup form. This is straightforward to show by 
arguments like those used before. Needless to say, the two conditions are quite restrictive. 



5 Relation to Principal Oscillation Pattern (POP) Analysis 

There is a very close relation of the foregoing theory with the principal oscillation pattern 
(POP) analysis, particularly as it was developed by C. Penland Q. In her approach, the POP 
method is a procedure to derive directly from the empirical time series data for a selected set 
of variables the linear Langevin dynamics whose stochastic solution has the same mean and 
covariance as those empirically derived, if such a Langevin model exists. Her method can be 
explained in terms of the equations used above. Indeed, assuming the validity of a Langevin 
equation such as ( |1.5[ ), it is easy to show that, for any t > t' , 

|c(t,0=A(t)C(t,0- (5.1) 
where C{t,t') := {ip{t)'il) (t')) is the empirical 2-time covariance matrix. Thus, the linear 



dynamical matrix A(t) can be obtained as 

C'Hf), (5.2) 



A(t)=|c(M') 



t'=t 



where, as before, C(t) := C{t,t). Once A(t) is known, the FDT relation analogous to (4.13) 
can be used to determine Q(t) from A(t) and C(t). This is essentially the procedure proposed 
by Penland to deduce the Langevin model from the data (with appropriate changes having been 
made to allow for the general case of time-dependent statistics considered here). 

Of course, such a Langevin model need not exist at all, as pointed out also by Penland. There 
are some basic consistency properties that must be satisfied, if this is to be possible. First, the 
computed noise covariance Q(t) must be positive-definite. This is the same type of realizability 
condition that was encountered in the Rayleigh-Ritz approach. It is a qualitative check of the 
Langevin model assumption, basically amounting to a statistical stability condition within that 
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framework. A more stringent and quantitative property for validity of a linear Langevin model 
is deduced by inverting (^), to obtain 

A{t) = jC{t,t')-C-\t,t'). (5.3) 

This must hold for all t' < t. In general, the righthand side of ( |5.3| ) defines an object A{t, t') 
for t' = t — T < t which will have a nontrivial dependence upon the time lag r. To be consistent 
with a linear Langevin model, however, there should be no such dependence. Hence, the degree 
of constancy of A(t, t') in the lag time r is a quantitative measure of the validity of the linear 
Langevin modeling assumption. This is Penland's "r-test" Q. 

Although the Rayleigh-Ritz and POP methods are seen to be closely related, they have 
almost opposite points of view. The Rayleigh-Ritz approach is an a priori theoretical method, 
whereas the POP approach is a posteriori and empirical. That is, the Rayleigh-Ritz method uses 
the underlying dynamical equations of motion computationally, in conjunction with physically- 
inspired guesses for the system statistics. Thus, it deduces the linear Langevin model without 
any direct empirical input (aside from experimental knowledge which may have been exploited to 
develop suitable PDF Ansdtze for the problem). On the other hand, the POP method makes no 
use of the dynamical equation of motion, and, indeed, could be applied to time series generated 
by very different means than a dynamical equation. POP is blind to theoretical considerations, 
except through the choice of relevant variables 'i/'(t) to be used in the analysis. Because the 
two approaches have such different philosophies but yet a close formal relationship, they should 
be quite complementary in assaulting a given problem. In both cases, a linear Langevin model 
is obtained which is supposed to reproduce faithfully all first- and second-order correlators of 
the selected set of variables. Thus, the Langevin model deduced theoretically by Rayleigh- 
Ritz may be compared directly with that deduced from the experimental data via POP. On 
the other hand, a successful application of the empirical POP procedure for a given set of 
variables — with "success" meaning here that realizability of the noise covariance is satisfied 
and that lag-dependence of the deduced matrix A(t, t') is weak — would imply the possibility of 
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carrying out a successful Rayleigh-Ritz approximation with the same set of variables. 

Although POP is an a posteriori method, relying upon a substantial empirical input, it has 
predictive power. This capability rests upon a basic hypothesis: that the POP Langevin model, 
while constructed only to reproduce partially the second-order statistics, may also be used to 
predict other statistical properties of the system with some accuracy. In particular, quantities 
such as transition probability densities P{tp,t\tpQ,tQ) can be deduced from the POP Langevin 
model. For many problems of weather and climate prediction, such probabilities would yield 
crucial information. For example, a measure of the spread of the predictions, such as Penland's 
relative discrepancy: 

can be estimated. This quantity is itself second-order, but not one used in the derivation of the 
POP Langevin model and thus not one that the model is guaranteed to predict successfully for 
arbitrary time lags t = t — t^. 

The Rayleigh-Ritz method has the potential for superior predictive ability, particularly with 
regard to non-Gaussian statistics and large fluctuations. As we have noted, the full Rayleigh- 
Ritz calculation predicts non-vanishing higher-order cumulants of the moment variables, as 
required for non-Gaussian statistics. Thus, when the statistics of the problem — such as the 
transition probabilities — have a very non-Gaussian form, the Rayleigh-Ritz approximation may 
still derive them successfully. Previous work on simple systems has already shown that very 
large fluctuations, far outside the Gaussian core, may be successfully captured by a Rayleigh- 



Ritz calculation. See the examples in |11|. Thus, the Rayleigh-Ritz method can yield crucial 
information about such large fluctuations, not available by a POP analysis. When the system 
is strongly fluctuating, and the most probable future event is only weakly selected, realizations 
deviating from that predicted event by percentages » (^(i,to) would have sizable probability. 
In that case, a Gaussian transition density, such as always yielded by a linear Langevin model, 
would yield very misleading estimates of event probabilities. The Rayleigh-Ritz method has 
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the potential to predict better the non-Gaussian probabilities of such large-deviation events. 



6 Fluctuation-Dissipation Relations 

A basic premise of our work is that the dynamical system considered is statistically stable, 
i.e. that the probability measures V{t) which solve the Liouville equation dtP{t) = CV{t) for 
all initial conditions Vq converge to a unique invariant measure Poo as t ^ oo. I Of course, 
such statistical stability is not precluded — indeed, is even assisted — by chaotic instability of 
the underlying micro dynamics. In this context, one expects that a generalized second law of 
thermodynamics should apply, appropriate to dissipative dynamical systems that are driven by 
external forces or open to the environment. In such a circumstance the usual thermodynamic 
entropy of the system proper obviously need not increase, but only the overall entropy of the 
system plus environment. An entropy function appropriate to describe the irreversible decay of 
the system to its stable, dissipative steady-state is provided by the relative entropy introduced 
in Section 2. Its production rate (or, rather, destruction rate, with our sign convention) is 
zero in the steady-state itself, and thus does not account for the dissipative processes occuring 
therein. The latter have been subtracted out, in the definition of the relative entropy. However, 
the relative or generalized entropy turns out to be the most useful concept in the dynamical 
description of the system proper, since it provides a Lyapunov functional for the irreversible 
decay to the statistical steady-state. Furthermore, the usual relations between random fluctu- 
ations and mean dissipation — the fluctuation- dissipation relations — are valid in terms of this 
generalized or "excess" entropy production within the Rayleigh-Ritz approximation, subject 
to satisfaction of realizability constraints. Such results for statistical steady-states, under hy- 



potheses paralleling those made here, are due originally to Schlogl [13|. In view of the generality 
of these results, it is appropriate to give here a brief account. 



®This remark applies to autonomous evolution only, in which the Liouville operator £, has no explicit time- 
dependence. 
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The generalized entropy relative to the predicted mean history m*(t) is defined by 

H,{ii,t) ■.= H{fi\m,{t),t), (6.1) 



where H on the righthand side is given by ( 2.15D in the text. The generalized entropy production 



or excess dissipation is then defined by ry*(/i, t) := ^H^{m{t'),t') ^ where m(-) is the solution 
of the closure equation which satisfies m(t) = ^i. Thus, a simple calculation gives 

dH 

=h,(At,t)-V(^,t) + -^(^,t). (6.2) 

It is not hard to show that, to quadratic order accuracy in small deviations, 

r/, := - {1.1 + ^C,^ : JMh + O (dh^) . (6.3) 

We shall sketch the proof below. First, however, let us recall the relation between the noise 
covariance Q*(t) and the Onsager matrix L*(t) = — A^,(t)C*(i), already given in ( 2.3S| ): 



Q, = L: + ^a. (6.4) 

This is the fluctuation- dissipation relation of the first type. Along with ( |6.3| ) it allows one to 
express the quadratic part of the entropy production (or dissipation) directly in terms of the 
noise covariance Q*: 

:= -Q* : <5Mh + O (dh^) . (6.5) 

Thus, the FDR of 1st type expresses a direct connection between the noise characteristics and 
the dissipative part of the linear dynamics. The generalized entropy /?*(/x,t) is a nonnegative, 
convex function, vanishing at m*(t). When the noise covariance Q*(t) is positive-definite, ( |6.5D 
implies that the generalized entropy is a Lyapunov function for the closure dynamics. At least 
for small deviations 6m{t) from the solution m*(t), where the linearized dynamics 6ih{t) = 
— L*(t)(5h(t) applies, the entropy H^:{t) is guaranteed to decrease in time, dH^{t)/dt < 0. This 
implies stability of the history m*(t) under the closure dynamics, generalizing the results of 
Schlogl |l^ to non-steady states of strongly fluctuating systems. 
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The proof of (|6.3| ) is as follows: Expanding the generalized entropy in a power series about 
m,,(t) yields, with (5//(t) := fi — m^<(t), 

H4fi,t) = ^T^t) : 5/x(t)5/x(t) + ^r?)(t);5/x(t)5/i(t)5/i(t) + O (5/) . (6.6) 

Recall that Ti^\t) := ^^(m*(t),t). Taking one derivative of ( |6.6[ ) with respect to fj, gives 

h, (/I, t) = r, (t) • 6fi{t) + irf ) (t) : 6ti{t)6tMit) + O (^^i^^ . (6.7) 

Introducing dh{t) := T^{t) ■ 5^{t), the latter becomes 

{fx,t)= 6hit) + ^rf ) (t) : 5ii{t)5tx{t) + O (V) • (6.8) 

A similiar Taylor expansion of the dynamical vector field V gives 

V(/x,t) = \,{t)+A,{t)5ii{t) + 0(5i?) 

= V* {t) - L, (t)<5h(t) + O (5h^) , (6.9) 

with V*(t) := V(m*(i), t). Now the Taylor expansion of the first part of the excess dissipation 
(the force-flux quadratic form) can be obtained by direct substitution of (|6.8D 



h,.V = (5h-V* - : 6h6h + ^vi^^ 15 fi5 fxY ^ + O ((J/i^) . (6.10) 

The second part of the entropy production is obtained by partial differentiation of ( |6.6D : 

^ = -Sh-Y, + l^(m,(t),t) : 6^,6^l+^^^{nl,{t),t)■:6^,6^l6^l + 0[^fl^) . (6.11) 

We made use of the facts that ^5^i{t) = ^(/x — va.^{t)) = —Y^:{t) and that, for every non- 
negative integer p, V^^^^\m^{t),t) = (m^,(t),t). Adding together the two parts of the 
entropy production from (|6.10| ),( |6.11| ) then gives 

1 /^(3) dV, 



77, = -L, : (5Mh + - [Vr • V* + ^ ) : 6^l6^l + 0[5h-'). (6.12) 

.(3) , dV 



If one recalls that ^T* = F* •V.,, -|- ^g^, as in ( ^.32 ), then we obtain finally 



-L, : 5h(5h + -f , : (5/x(5/x + O (dh^^ 



2 
32 



= + ^C, j : 6h6h + O [Sh^) , (6.13) 

where F^, = — r*C*r* was employed in the last line. This is just (|6.3|), as was claimed. 

There is another result, the fluctuation- dissipation relation of the second type, which holds 
for a general linear Langevin model. This relation expresses a proportionality between the 
mean response function to an appropriately coupled force and a time-derivative of the 2-time 
correlation function. The equation to be considered is 

5^ = -L,(t)(5h-f(t))+q(t) (6.14) 

where f (t) is a deterministic external force. Because of the linearity of this equation, it follows 
immediately that the corresponding response function H(t,to) := ^jt^ is non-random and its 
average H=K(t, to) := (H(t,to)) is thus given just by the solution of 

d 

— H,(t,to) = A,(t)H,(t,to) + L,(to)5(t-to)- (6.15) 
ot 

It is not hard to see that the latter solution is 

H,(t,to) = G*(t,to)L*(to), (6.16) 



with G* the matrix Greens function defined in ( [4.91) . On the other hand, by using the same 
Greens function to solve ( |5.1| ) for C^,(t,to) starting from time to, it is determined that 

C,(t, to) = G^{t, to)C*(to). (6.17) 

Because L^,{to) ■= —A^{to)C^,{to) and because ■^G^,{t,to) = — G*(t, to)A*(to) for t > to, it 
follows from (|6l6|),(|6T7D that 

J- [C,(t,to)r(to)] = il,{t,to)T{to) (6.18) 

for t > to- This proportionality is termed an FDR of 2nd type. The reader should be cautioned 
at this point that there is a great divergence of terminology in the literature. One finds often 
the following terms used instead: FDR of 1st kind to indicate what we called FDR of 2nd type 
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and FDR of 2nd kind to indicate our FDR of 1st type. There are also authors who call (6.17) 
the FDR of 2nd type (1st kind) rather than ( |6.18| )! For further discussion of these matters, see 
|14|, Sections 3.2, 4.1 and Appendix D. 

Another relation of the "2nd type" exists within the Rayleigh-Ritz formalism. This has 
a slightly different character, in that the external perturbation field is now added to the de- 
terministic equation. As the simplest example, consider the following perturbation of the 
moment-closure equations: 

rii = V(m, t) + C(m, t)-h(t), (6.19) 

in which C(m, t) is the model single-time covariance matrix provided by the PDF Ansatz. Then, 
if R(t, to) := shlfj) h corresponding response function, it is easy to see by functional 

differentiation that, for initial conditions m(to) = in*o in the above equation, R^, satisfies 

dtYi^{t,tQ) = A^{t)R^{t,to) + C^{to)5{t - to), (6.20) 

The solution is just R^<(t,to) = G*(t, to)C*(to) for t > to- Thus, we see by reference to (6.17) 
above and the symmetry of the covariance that 

a(t,to) =R*(t,to) +Rj(t,to), (6.21) 

where Rj(t, t') := [R=K(t', t)]^. The relation ( |6.21| ) might be better termed a fluctuation-response 
relation, in analogy to that of Kraichnan [^]. It turns out that this relation is completely 
general within the Rayleigh-Ritz method. In fact, equation ( |6.19| ) above is nothing more than 
the Euler-Lagrange equation for m(t) in the Rayleigh-Ritz algorithm, when the expectation 
constraint ( p.5| ) is incorporated via a Lagrange multiplier h(t). That is, equation ( 6.1S| ) above 



is equivalent to equation (3.93) in All of these statements remain true even when the 
random variables whose 2-time covariance is to be approximated by Rayleigh-Ritz are not the 
basic moment-variables appearing in the closure and the linear Langevin model is not available. 
The demonstration of this fact will be given elsewhere [l^, since it is outside the scope of the 
present work. The result (6.21) is very useful, because it provides the most efficient numerical 



procedure to extract the Rayleigh-Ritz predictions for the 2-time covariances. 
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7 Conclusions 



In this work we have shown how the general Rayleigh-Ritz algorithm for statistical dynamics 
of nonlinear systems, proposed in gives rise to linear Langevin models. Such stochastic 
models reproduce the predictions of the full Rayleigh-Ritz calculation for 2nd-order statistics. 
In general, for higher order statistics and larger fluctuations, the two methods yield different 
predictions. Thus, the Rayleigh-Ritz approach gives also a means to assess a priori the domain 
of validity of the linear Langevin models. A more physical derivation of Langevin models 
was also sketched. The theoretical and a priori Rayleigh-Ritz method was compared with 
the empirical and a posteriori POP method of Penland. Finally, some general results on the 
thermodynamics of statistical moments — law of entropy increase and fluctuation-dissipation 
relations at the linear level — were derived within the Rayleigh-Ritz formalism. 
Acknowledgements: I wish to thank Frank Alexander for many useful conversations on the 
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